An Attempt To Computerized Dictionary Data Bases

نویسندگان

  • Makoto Nagao
  • Jun'ichi Tsujii
  • Yoshihiro Ueda
  • M. Takiyama
چکیده

Two dictionary data base systems developed at Kyoto University are presented in this paper. One is the system for a Japanese dictionary ( Shinmeikai Kokugojiten, published by Sansei-do) and the other is for an English-Japanese dictionary (New Concise English-Japanese Dictionary, also published by Sansei-do). Both are medium size dictionaries which contain about 60,000 lexical items. The topics discussed in this paper are divided into two sub-topics. The first topic is about data translation problem of large, unformatted linguistic data. Up to now, no serious attempts have been made to this problem, though several systems have been proposed to translate data in a certain format into another. A universal data translator/verifier, called DTV, has been developed and used for data translation of the two dictionaries. The detailed construction of DTV will be given. The other sub-topic is about the problem of data organization which is appropriate for dictionaries. It is emphasized that the distinction between 'external structures' and 'internal structures' is important in a dictionary system. Though the external structures can be easily managed by general DBMS's, the internal (or linguistic) structures cannot be well manipulated. Some additional, linguistic oriented operations should be incorprated in dictionary data base systems with universal DBMS operations. Some examples of applications of the dictionary systems will also be given.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Creating And Querying Lexical Data Bases

Users of computerized dictionaries require powerful and flexible tools for analyzing and manipulating the information in them. This paper discusses a system for grammatically describing and parsing entries from machine-readable dictionary tapes and a lexicai data base representation for storing the dictionary information. It also describes a language for querying, formatting, and maintaining di...

متن کامل

Speech Enhancement using Adaptive Data-Based Dictionary Learning

In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...

متن کامل

An English Dictionary for Computerized Syntactic and Semantic Processing Systems

R. F. SIMMONS (1970) and M. PA~AK and A. W. PRATT (1971) point out that no computerized system using natural language either as part of the processor or as the object processed and having a syntactico-semantic component has a lexicon of more than a few hundred items (except for the SNOV' s medical lexicon). It is obvious from the lack • of success of large-scale computerized systems using natur...

متن کامل

Multilingual Lexical Knowledge Bases: Applied WordNet Prospects

The idea of a Lexical knowledge base was recently proposed by the ESPRIT BRA AQUILEX [Briscoe 91], [Calzolari 92] project, to provide information, mostly of a semantic nature, internally consistently structured and electronically available. Three levels of lexical representation are proposed in AQUILEX: (a) Machine Readable Dictionary (MRD), i.e. an electronic version of the paper dictionary; (...

متن کامل

Desirable Characteristics of Information Resource Dictionary Systems

It is now widely accepted that an organization’s information is one of its most valuable resources. The management of this resource, especially the organization’s computerized data bases, has become a critical function for its continued survival and viability. Data administration is the function that assists the organization in the management and control of data. This role has therefore achieve...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1980